{
"metadata": {
"name": "Module 3 DESeq"
},
"nbformat": 3,
"nbformat_minor": 0,
"worksheets": [
{
"cells": [
{
"cell_type": "code",
"collapsed": false,
"input": "#having issues installing r to python coding. Instead I will be doing everything through R studio. \n#Using the Gonad Transcriptome from Mac, I ran an RNA-Seq analysis using CLC following the instructions provided from Steven's video.\n#Quick Run down. Import HTPsequence data to CLC. Run RNA-Seq Through CLC Server on two transcriptomes.\n#Set Up Experiment, compare two transcriptomes (gill, gonad) Export with labelled data to CSV file. \n#The compilation took several days but I was able to get a viable file set from it. \n#I exported the CSV file to Eagle into my bioinformatics file. From there I have downloaded them locally to use in R. \n#In the mean time I completed the trial script program Steven provided in Module 3. I will be modifying the coding in this script,\n#using R Studio to edit coding and run it. ",
"language": "python",
"metadata": {},
"outputs": [],
"prompt_number": 1
},
{
"cell_type": "code",
"collapsed": false,
"input": "cd C:\\Users\\Christine Savolainen\\Desktop\\Bio Informatics\\DESeq\\DESeq",
"language": "python",
"metadata": {},
"outputs": [
{
"output_type": "stream",
"stream": "stdout",
"text": "C:\\Users\\Christine Savolainen\\Desktop\\Bio Informatics\\DESeq\\DESeq\n"
}
],
"prompt_number": 4
},
{
"cell_type": "code",
"collapsed": false,
"input": "!head -2 GonadvGill.txt",
"language": "python",
"metadata": {},
"outputs": [
{
"output_type": "stream",
"stream": "stderr",
"text": "head: cannot open `GonadvGill.txt' for reading: No such file or directory\n"
}
],
"prompt_number": 5
},
{
"cell_type": "code",
"collapsed": false,
"input": "#using the following R code, converted CSV to TXT. \n#govgi = read.csv(\"GonadvsGill.csv\")\n#write.table(govgi,\"govgi.txt\", sep=\"\\t\")",
"language": "python",
"metadata": {},
"outputs": [],
"prompt_number": 7
},
{
"cell_type": "code",
"collapsed": false,
"input": "!head -2 govsgi.txt",
"language": "python",
"metadata": {},
"outputs": [
{
"output_type": "stream",
"stream": "stdout",
"text": "\"Feature.ID\"\t\"Range..original.values.\"\t\"IQR..original.values.\"\t\"Difference..original.values.\"\t\"Fold.Change..original.values.\"\t\"Expression.values\"\t\"Gene.length\"\t\"Unique.gene.reads\"\t\"Total.gene.reads\"\t\"RPKM\"\t\"Means\"\t\"Expression.values.1\"\t\"Gene.length.1\"\t\"Unique.gene.reads.1\"\t\"Total.gene.reads.1\"\t\"RPKM.1\"\t\"Means.1\"\n\"1\"\t\"CGI_10000001\"\t\"190\"\t\"190\"\t\"190\"\t\"3.346\"\t\"81\"\t\"351\"\t\"0\"\t\"81\"\t\"12.668\"\t\"81\"\t\"271\"\t\"351\"\t\"0\"\t\"271\"\t\"17.384\"\t\"271\"\n"
}
],
"prompt_number": 5
},
{
"cell_type": "code",
"collapsed": false,
"input": "!dos2unix < govsgi.txt > govsgiunix.txt",
"language": "python",
"metadata": {},
"outputs": [
{
"output_type": "stream",
"stream": "stderr",
"text": "'dos2unix' is not recognized as an internal or external command,\noperable program or batch file.\n"
}
],
"prompt_number": 7
},
{
"cell_type": "code",
"collapsed": false,
"input": "!sed 's/\\\"/ /g' -input govsgi.txt -output govsgi2.txt",
"language": "python",
"metadata": {},
"outputs": [
{
"output_type": "stream",
"stream": "stderr",
"text": "sed: unknown option -- o\nUsage: sed [OPTION]... {script-only-if-no-other-script} [input-file]...\n\n -n, --quiet, --silent\n suppress automatic printing of pattern space\n -e script, --expression=script\n add the script to the commands to be executed\n -f script-file, --file=script-file\n add the contents of script-file to the commands to be executed\n --follow-symlinks\n follow symlinks when processing in place\n -i[SUFFIX], --in-place[=SUFFIX]\n edit files in place (makes backup if SUFFIX supplied)\n -b, --binary\n open files in binary mode (CR+LFs are not processed specially)\n -l N, --line-length=N\n specify the desired line-wrap length for the `l' command\n --posix\n disable all GNU extensions.\n -r, --regexp-extended\n use extended regular expressions in the script.\n -s, --separate\n consider files as separate rather than as a single continuous\n long stream.\n -u, --unbuffered\n load minimal amounts of data from the input files and flush\n the output buffers more often\n -z, --null-data\n separate lines by NUL characters\n --help display this help and exit\n --version output version information and exit\n\nIf no -e, --expression, -f, or --file option is given, then the first\nnon-option argument is taken as the sed script to interpret. All\nremaining arguments are names of input files; if no input files are\nspecified, then the standard input is read.\n\nGNU sed home page: